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REMARKS 

Applicants and the undersigned are most grateful for the time and effort accorded 
the instant application by the Examiner, The Office is respectfully requested to 
reconsider the rejections presented in the outstanding Office Action in light of the 
following remarks. 

The disclosure continues to be objected to because of a number of assened 
informalities. Equation 1 on Page 8 has been amended to conrect a minor typographical 
error. The status identifier for Qaim 2 has also been rewritten as "Previously F*resented", 
Thus, it is submitted these objections should be withdrawn. 

Claims 1 3 and 26 stand rejected under 35 USC 1 12, second paragraph, as being 
indefinite for failing lo particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. In particular, the Office is objecting to the variable C 
not being defined in the claim. Applicants respectfully traverse this rejection and assert 
the claims are not indefinite as C is understood by one of ordinary skiU in the art. Thus, it 
is submitted this rejection should be withdrawn. 

Claims 1-3, 6-12, 14-16, 19-25 and 27 stand rejected under 35 USC 103(a) over 
Goldenthal et al. in view of Newman et al. Specifically the Office asserted that "it would 
have obvious to one of ordhiary skill in the art at the time the invention was made to 
modify Goldenthal by specifically providing multiple phonetic detail levels and the 
corresponding process, as taught by Newman, for the purpose of increasing efficiency and 
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quality of a recognition system." (Office Action at 6) Reconsideration and withdrawal of 
the present rejection is hereby respectfully requested. 

The present invention broadly contemplates, in accordance with at least one 
presently preferred embodiment, the calculation of scores in such a way that the total 
likelihood is a weighted sum of the likelihood of all phonetic units at all levels of 
phonetic granularity (model grains), and that the weights are derived in such a way that 
the determination of the robustness and significance of the individual model grains is 
approached with emphasis, (Page 2, line 16 - Page 3, line 4) Given a structured model 
M(ij) for a speaker with 1 < i < L levels of detail and with 1 <7 < K(i) units on the i-th 
level, the score (as log-probability) for the utterance is calculated in each level separately, 
whereby explicit labeling information is used to identify the corresponding phonetic imii 
that is to be used on each leveh (Page 8» lines 5-8) As discussed in the specification, the 
number of units on each level and the number of levels may vary across speakers, since 
there might be less data available from certain speakers, entailing the necessity of 
omitting certain units altogether. (Page 10, lines 7-9) 

The previous comments regarding Goldcnthal are equally applicable here. As 
presently best understood, Goldenthal appears to be directed to a two-stage cohort 
selection technique used to reduce the equal error rate of a speaker verification process 
which validates the claimed identity of an unknown speaker, (Col. 2, lines 61-65) First 
the digitized signals of an unknown speaker seeking verification are compared with 
acoustic models corresponding to the claimed identification to determine **claimed" log 
likelihood scores. (Col. 5, lines 18-22) Then the same testing signals are Compared with 
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all of the cohort models to detennine cohort log likelihood scores and then a smaller 
subset of cohort scores is dynamically selected. (CoL 5, lines 23-30) The claimed scores 
and the dynamically selected scores are then presented to a validator which determines 
whether or not a threshold difference between the two scores is present. (CoL 5, lines 31- 
46) There is, however, no teaching or suggestion that the models used in calculating the 
various log likelihood scores have multiple levels of detail or that scores are calculated 
for each level of detail. In fact, the Office admits "Goldenthal fails to expressly disclose 
'capable of having a plurality of phonetic detail of varying resolution for each frame* and 
determining a likelihood value for 'each level' of the phonetic detail of target speaker 
model." (Office Action at 6) 

The previous comments regarding Newman are also equally applicable here. As 
presently best understood, Newman appears to be directed towards producing a speech 
model for use in determining whether a Speaker associated with the Speech model 
produced an unidentified speech sample. The speech model is produced without using an 
external mechanism to monitor the accuracy with which the contents were identified. 
(Col. 1, line 65 - CoL 2, line 1 1) As noted in the Office Action, each model of a word 
may be represented by a set of phonemes that represent the phonetic spelling of a word, 
(emphasis added) Furthermore, each phoneme may be represented by three sets of model 
parameters that correspond to the three nodes of the phoneme. (Col. 6, lines 26-29) 

The Office Action, however, takes the position that this suggests "that the system 
includes multiple levels of phonetic detail and the corresponding processing for each 
level, as claimed," {Office Action at 6) Contrary to the position taken in the Office 
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Action, however, there is no indication in Newman that each of the three sets of model 
parameters provides anything other than the same level of detail for each node of the 
phoneme. This is not, however, having multiple levels of phonetic detail in accordance 
with the present invention. 

The present invention is "capable of having a plurality of levels of phonetic detail 
of varying resolution for each frame**, (Qaim 1; emphasis added) However, as stated 
above, Newman appears to teach having multiple levels of phonetic detail for each word. 
Additionally, Newman appears to teach that each frame can be related to a particular 
phoneme node after disclosing that each word is represented by a series of phonemes. 
(Newman: column 3, lines 51-62). In the context of the instant invention, Newman 
appears to teach that each frame relates to one level of phonetic detail, 

A 35 U.S.C. 103(a) rejection requires that the combined cited references provide 
both the motivation to combine the references and an expectation of success. There is, 
however, absolutely no teaching or suggestion in Newman that would lead One of 
ordinary skill in the art to modify Goldcnthal to arrive at the present invention. 
Moreover, actually combining the teachings of Goldcnthal and Newman would not result 
in the in the present invention which requires specifically ''providing a model 
corresponding to a target speaker, the model being resolved into at least one frame and 
capable of having a plurality of levels of phonetic detail of varying resolution for each 
frame" and "determining, for each frame and each level of phonetic detail of the target 
speaker model, a likelihood value; and resolving the at least one likelihood value to 
obtain a likelihood score.'' (Claim 1; emphasis added) Similar language appears in the 
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other independent claims. This hierarchical approach in wMch there are a plurality of 
levels of phonetic detail with varying resolution, determining a likelihood value for each 
level, and then using the likelihood values to deteunide a likelihood score is simply not 
taught or suggested by either Goldenthal or Newman or the combination of the two. 

Applicants acknowledge that Qaims 4-5, 13, 17-18 and 26 were indicated by the 
Examiner as being allowable if rewritten in independent form. Applicants reserve the 
right to file new claims of such scope at a later date that would still, at that point, 
presumably be allowable. 

In view of the foregoing, it is respectfully submitted that Claims 1, 14 and 27 fully 
distinguish over the applied art and is thus are in condition for allowance. By virtue of 
dependence from what are believed to be allowable independent Claims 1 and 14. it is 
respectfully submitted that Claims 2-13 and 15-26 are also presently allowable. 
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In summaiyt it is respectfully submitted that the instant application, including 
Claims 1-27, is presently in condition for allowance. Notice to the effect is hereby 
earnestly solicited. In the unlikely event, however, it appears the claims will not be 
allowed, the Office is invited to call the undersigned to discuss the claims prior to the 
issuance of a further Office Action. 



Respectfully submitted. 
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